ACC-SVM: Accelerating SVM on GPUs using OpenACC

نویسندگان

  • Rengan Xu
  • Dounia Khaldi
  • Abid M. Malik
  • Barbara Chapman
چکیده

GPUs have been successfully applied in scientific computing in the last decade. Many machine learning algorithms have also used GPUs to accelerate their computations. This includes the Support Vector Machine (SVM) which is a classical machine learning algorithm that has been successfully used in many applications such as text classification and image recognition. There have been many open-source CUDA SVM implementations. However, CUDA versions of SVM are not portable and difficult to maintain or redesign. Porting SVM to a directive-based portable model like OpenACC will make it possible to target multiple accelerators. In this paper, we use OpenACC programming model to parallelize SVM and produce ACC-SVM. Since the high-level programming model simplifies the programming by sacrificing performance, there is a performance gap between the OpenACC and the CUDA SVM versions. In order to improve the performance of ACC-SVM, we apply our auto-tuning framework to decrease the gap between the CUDA and the OpenACC performance results. The performance difference between the optimized ACC-SVM resulting from the auto-tuner and the CUDA SVM is 15.58% for the kernels code and 7.87% with respect to the whole application. For many applications this loss of performance is more than made up for by the convenience and portability of the high-level approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison

Phylogenetic inference is used to derive a “tree of life” for a collection of species whose DNA sequences are known. While several software packages have already been developed to take advantage of GPUs to accelerate phylogenetic inference, they typically require significant changes to the original code, constraining code maintenance. Recently, the OpenACC API was proposed to minimize the progr...

متن کامل

A portable platform for accelerated PIC codes and its application to GPUs using OpenACC

We present a portable platform, called PIC ENGINE, for accelerating Particle-In-Cell (PIC) codes on heterogeneous many-core architectures such as Graphic Processing Units (GPUs). The aim of this development is efficient simulations on future exascale systems by allowing different parallelization strategies depending on the application problem and the specific architecture. To this end, this pla...

متن کامل

Accelerating high-order WENO schemes using two heterogeneous GPUs

A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...

متن کامل

Accelerating lattice QCD simulations with 2 flavours of staggered fermions on multiple GPUs using OpenACC - a first attempt

We present the results of an effort to accelerate a Rational Hybrid Monte Carlo (RHMC) program for lattice quantum chromodynamics (QCD) simulation for 2 flavours of staggered fermions on multiple Kepler K20X GPUs distributed on different nodes of a Cray XC30. We do not use CUDA but adopt a higher level directive based programming approach using the OpenACC platform. The lattice QCD algorithm is...

متن کامل

Lattice Simulations using OpenACC compilers

OpenACC compilers allow one to use Graphics Processing Units without having to write explicit CUDA codes. Programs can be modified incrementally using OpenMP like directives which causes the compiler to generate CUDA kernels to be run on the GPUs. In this article we look at the performance gain in lattice simulations with dynamical fermions using OpenACC compilers.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016